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Abstract 

Complex networks from such different fields as biology, technology or sociology share 
similar organization principles. The possibility of a unique growth mechanism for a 
variety of complex networks in different fields such as biology, technology or sociology 
is of interest, as it promises to uncover the universal origins of collective behavior. 
The emergence of self-similarity in complex networks raises the fundamental question 
of the growth process according to which these structures evolve. Here, we use the 
concept of renormalization as a mechanism for the growth of fractal and non-fractal 
modular networks. We show that the key principle that gives rise to the fractal 
architecture of networks is a strong effective "repulsion" (disassortativity) between 
the most connected nodes (hubs) on all length scales, rendering them very dispersed. 
More importantly, we show that a robust network comprised of functional modules, 
such as a cellular network, necessitates a fractal topology, suggestive of an evolutionary 
drive for their existence. 
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An important result in statistical physics was the generation of fractal geometries by 
Mandelbrot , the structures of which look the same on all length scales. Their im- 
portance stem from the fact that these structures were recognized in numerous examples in 
Nature, from snowflakes and trees to phase transitions in critical phenomena 0,0]. While 
these fascinating patterns are only geometric, new forms of topological fractality have been 
observed in complex networks 14] where the links rely on interactions between the partici- 
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pants [5|, [6|. Examples of topological fractal networks include the hyperlinks in the WWW, 
physical interactions in protein interaction networks or biochemical reactions in metabolism 
4,Q]. Other complex networks such as the Internet do not share the topological fractal 
property. 

These fractal complex networks are characterized by the small-world property (as given 
by the logarithmic dependence of the average distance with the number of nodes) brought 
about by the "short-cuts" in the network lal, a very wide (power-law or "scale-free" |9||) 
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distribution of connections, and a modular hierarchical structure [10, HJJ, 112, . However, 
;he fractal sets of Mandelbrot do not exhibit these features. While in our previous work 
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we discovered the fractal nature of organization in many real networks the question 
remained how these networks have evolved in time. We therefore launch a study of growth 
mechanisms to understand the simultaneous emergence of fractality, modularity, as well as 
the small world effect, and the scale- free property in real world complex networks. Our 
results have important evolutionary implications. They highlight an evolutionary drive 
towards fractality, inspired by an increase in network robustness. Thus, a robust modular 
network requires a fractal topology. Furthermore, our analysis indicates that the fractal 
clusters can be identified with the functional modules in the case of the metabolic network 
of E. coli. 

The "democratic" rule of the seminal Erdos-Renyi model Q| (where the nodes in the 
network are connected at random) was first invoked to explain the small world effect. It was 
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then replaced by the "rich-get-richer" principle of preferential attachment |2( to explain the 
scale-free property; a discovery carrying important implications on network vulnerability 
rlowever these rules do not capture the fractal topologies found in diverse complex 
networks. We find that models of scale-free networks are not fractals (see Supplementary 
Materials Section |1J). Here, we demonstrate a new view of network dynamics where the 
growth takes place multiplicatively in a correlated self-similar modular fashion, in contrast 



2 



to the uncorrelated growth of models of preferential attachment 

We formalize these ideas by borrowing the concept of "length-scale renormalization" from 
critical phenomena JjJ. In this paper, we will show that the emergence of self-similar fractal 
networks, such as cellular ones, is due to the ,^^(^^0) between 
the hubs at all length scales. In other words, the hubs prefer to grow by connections to less- 
connected nodes rather than to other hubs, an effect that can be viewed as an effective hub 
repulsion. In this new paradigm, the "rich" still get richer, although at the expense of the 
"poor" . In other words, the hubs grow by preferentially linking with less-connected nodes to 
generate a more robust fractal topology. In contrast, weakly anticorrelated or uncorrelated 
growth leads to non- fractal topologies such as the Internet. 

Growth mechanism. — The renormalization scheme tiles a network of N nodes with 
N B (£ B ) boxes using the box-covering algorithm as shown in Fig. [TJi. The boxes contain 
nodes separated by a distance £ B , measured as the length of the shortest path between 
nodes. Each box is subsequently replaced by a node, and the process is repeated until the 
whole network is reduced to a single node. The way to distinguish between fractal and 
non-fractal networks is represented in their scaling properties as seen in Fig. and Eb- 
Fractal networks can be characterized by the following scaling relations (Fig. EK) : 

N B (£ B )/N ~T B iB and k B (£ B )/k hub ~ £~ d \ (1) 

where k huh and k B (£ B ) are the degree of the most connected node inside each box and that 
of each box respectively (Fig. [T^,). Although both of them are partial variables, the ratio 
between them is a global quantity, depending only on the length scale £ B , as we showed in 
J|. The two exponents d B and dj- are the fractal dimension and the degree exponent of the 
boxes, respectively. While the term "fractal dimension" is usually reserved for geometrical 
self-similarity, here we relax the usage to include the topological self-similarity as well. For 
a non-fractal network like the Internet (Fig. Eb), we have d B — > oo and df- — > oo; the scaling 
laws in Eq. ([TJ are replaced by exponential functions. 

Based on the results leading to Eq. (JT|), we propose a network growth dynamics as the 
inverse of the renormalization procedure. Thus, the coarse-grained networks of smaller size 
are network structures appearing earlier in time, as exemplified in Fig. QJi. A present time 
network with N(t) nodes is tiled with N B (£ B ) boxes of size £ B . Each box represents a node 
in a previous time step, so that N(t — 1) = N B (£ B ). The maximum degree of the nodes 



inside a box corresponds to the present time degree: k(t) = kh u b, which is renormalized such 
that k(t — 1) = ks^B)- The tilde over the quantities are needed in order to differentiate 
the dynamical quantities, such as the number of nodes as a function of time, N(t), from 
the static quantities, such the number of nodes of the present network, N, or the number 
of nodes of the renormalized network, Nb- The renormalization procedure applies to many 
complex networks in Nature |4|. These includes fractal networks such as WWW, protein 
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interaction networks of E. coli, the yeast and human, and metabolic networks of 43 
different organisms from the three domains of life, and some sociological networks. The 
renormalization scheme can be applied to non- fractal networks, such as the Internet, as well. 
Below we will show that the main difference between these two groups is in the connectivity 
correlation. We also provide empirical, analytical and modelling evidences supporting this 
theoretical framework based on the validity of exponents, scaling theory, and statistical 
properties of the connectivity correlation. 

Correlation. — A question of importance to elucidate the selection rules governing the 
fractality of the network is to determine how the nodes in older networks are connected to 
those of the present day. The answer lies in the statistical property of correlation between 
the nodes and boxes within a network configuration. Studying the correlation profile in 
real networks similar to those considered in Ilfli provides initial hints to the above 
question. The correlation profile [3] compares the joint probability distribution, P{k\, k 2 ), of 
finding a node with k\ links connected to a node with k 2 links with their random uncorrelated 
counterpart, P r (ki,k2), which is obtained by random swapping of the links, yet preserving 
the degree distribution. A plot of the ratio R(ki, k 2 ) = P(k±, k 2 )/P r {k\, k 2 ) provides evidence 
of correlated topological structure that deviates from the random uncorrelated case. 

At first glance, a qualitative classification based on the strength of the anticorrelation 
of different networks can be obtained by normalizing the ratio R(k±, k 2 ) to that of a given 
network, for instance the WWW [21], (Supplementary Materials, Section UTJ) . Figure Et 
and I2H show the correlation profiles of the cellular metabolic network of E. coli U3i , which 
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is known to be fractal, and the Internet at the router level [23J, which has a non- fractal 
topology. The fractal network poses a higher degree of anticorrelation or disassortativity; 
nodes with a large degree tend to be connected with nodes of a small degree. On the other 
hand, the non-fractal Internet is less anticorrelated. Thus, fractal topologies seem to display 
a higher degree of hub repulsion in their structure than non-fractals. However, for this 
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property to be the hallmark of fractality, it is required that the anticorrelation appears not 
only in the original network (captured by the correlation profiles of Fig. Efc and Fig. |2tl)- 
but also in the renormalized networks at all length scales. We note that other measures of 
anti—on, such a8 the Pearson coefficient r of the deuces at the end of an ed g e Q, 
cannot capture the difference between fractal and non-fractal network. We find that r is not 
invariant under renormalization. 

Mathematical model. — To quantitatively link the anticorrelation at all length scales 
to the emergence of fractality, we next develop a mathematical framework and demonstrate 
the mechanism for fractal network growth. In the case of modular networks, stemming from 
Eqs. (JTJ), we require that 

N(t) =nN(t-l), 
k{t) = sk{t- 1), (2) 
L(t) + L = a(L(t-l) + L ), 

where n > 1, s > 1 and a > 1 are time- independent constants and L{t) is the diameter of 
the network defined by the largest distance between nodes. The first equation is analogous 
to the tnnltiphcative process naturauy fonnd in tnany popniatton gn3 wth systems Q. The 
second relation is analogous to the preferential attachment rule It gives rise to the 
scale-free probability distribution of finding a node with degree k, P{k) ~ /c~ 7 . The third 
equation describes the growth of the diameter of the network and determines whether the 
network is small- world Q and/or fractal. Here we introduce the characteristic size Lq, 
the importance of which lies in describing the non-fractal networks. Since every quantity 
increases by a factor of n, s and a, we first derive (Supplementary Materials Section HV)) the 
scaling exponents in terms of the microscopic parameters: ds = Inn/ In a, dk = Ins/ In a. 
The exponent of the degree distribution satisfies 7 = 1+lnn/ Ins. The dynamics represented 
by Eqs. © consequently leads to a modular structure where modules are represented by 
the boxes. While modularity has often been identified with the scaling of the clustering 
coefficient ^j], here we propose an alternative definition of "modular network" as the one 
whose statistical properties remain invariant (in particular, an invariant degree distribution 
with the same exponent 7, see Supplementary Materials Section liTT|l under renormalization. 

In order to incorporate different growth modes in the dynamical Eqs. (j2J) we consider, 
without loss of generality, two modes of connectivity between boxes, whose relative frequen- 



cies of occurrence are controlled by the probability e representing the hub-hub attraction, (i) 
Mode I with probability e (Fig. \3p): two boxes are connected through a direct link between 
their hubs leading to hub- hub attraction, (ii) Mode II with probability 1 — e (Fig. ^): two 
boxes are connected via non-hubs leading to hub-hub repulsion or anticorrelation. We will 
show that Mode I leads to non-fractal networks while Mode II leads to fractal networks. In 
practice, though Eqs. © are deterministic, we combine these two modes according to the 
probability e, which renders our model probabilistic. 

Formally, for a node with k(t — 1) links at time t — 1, we define n^(t) as the number of 
links which are connected to hubs in the next time step (see Fig. ^i). Then the probability 
e satisfies: 

n h (t) =e fc(t-l). (3) 

Using the analogy between time evolution and renormalization, we introduce the cor- 
responding quantity, n h (£ B ), and defines the ratio S(£b) = ^{Ib) I^b{^b)- The nonlinear 
relation between t and £b leads to the £b dependence on £ (see Supplementary Materials, 
Section HV]) . In the extreme case of strong hub attraction, where the hubs of the boxes are 
connected at all length scales, we have £(Ib) ~ constant. On the other hand, hub repulsion 
leads to decreasing £{£b) with £g. From scaling we obtain a new exponent d e = — In e/ In a 
characterizing the strength of the anticorrelation in a scale-invariant way: 

S(l B )~t^. (4) 

Fig. |2t shows £(£b) for two real fractal and non- fractal networks: a map of the WWW 



domain (nd.edu) consisting of 352,728 web-sites 2l( and a map of the Internet at the router 
level consisting of 284,771 nodes |23|. We find that for the fractal WWW, d e = 1.5, indicating 
that it exhibits strong anticorrelation. On the other hand, the non-fractal Internet shows 
£((-b) ~ constant. 

These results confirm that fractal networks, including the protein interaction network 



2^] (with d e = 1.1) and the metabolic network of E. coli [22j (with d e = 4.5), do have strong 
hub repulsion at all length scales and non-fractal networks have no or weak hub repulsion. 

A general limitation when analyzing the scaling behavior of complex networks is the 
small range in which the scaling is valid. This is due to the small-world property that 
restricts the range of £b in Fig. As an attempt to circumvent this limitation, we offer 
not only the empirical determination of the exponents but also scaling theory and models 



where the exponents can be further tested. We should also point out that large exponents 
(such as d e — 4.5 for E. coli) may not be distinguishable from exponential behavior (infinite 
exponent). In this case, however the large exponent d e for E. coli agrees with our theoretical 
framework, since it corresponds to a network with large anticorrelation in the connectivity 
and the subsequent small fractal dimension. In terms of the model, this corresponds to the 
limit of e — > 0. 

Next we show how the different growth modes reproduce the empirical findings. While 
each mode leads to the scale-free topology, they differ in their fractal and small-world prop- 
erties. Mode I alone (e = 1) exhibits the small-world effect, but is not fractal due to its 
strong hub-hub attraction (see Fig. |TJd). On the other hand, Mode II alone (e = 0, Fig. [TJj) 
gives rise to a fractal network. However, in this case, the anticorrelation is strong enough to 
push the hubs far apart, leading to the disintegration of the small- world. Full details of the 
implementation of Mode I and Mode II are given in the Supplementary Materials Section 
ITVAl andlVl 

These results suggest that the simultaneous appearance of both the small-world and 
fractal properties in scale-free networks is due to a combination of the growth modes. In 
general, the growth process is a stochastic combination of Mode I (with probability e) and 
Mode II (with probability 1 — e). For the intermediate (0 < e < 1), the model predicts finite 
fractal exponents ds and dk and also bears the small-world property due to the presence of 
Mode I. Such a fractal small- world and scale-free network is visualized in Fig. EH for e = 0.8. 
Supporting evidences are given by (i) Fig. Et>, which shows that the model with e = 0.8 is 
more anticorrelated than the e = 1 model (Mode I), (ii) Fig. Et, which shows the power law 
dependence of N B on £ B for the fractal structure (e = 0.8), and the exponential dependence of 
the non-fractal structure (e = 1), and (Hi) Fig. Ell showing that Mode I reproduces S(£b) ~ 
constant while the e = 0.8 model gives £(£b) 

~ Eg e , which is in agreement with the empirical 
findings of Fig. |2H on real networks (the exponent d e = — In 0.8/ In 1.4 = 0.66 is predicted 
by the analytical formula according to Supplementary Material II Vj) . Furthermore, in the 
Supplementary Materials Section \l V Al we show that the predicted scale-free distribution is 
invariant under renormalization. Although simplistic, this minimal model clearly captures 
an essential property of networks: the relationship between anticorrelation and fractality 
(see Methods for more details). We have also considered the contribution of loops, which 
we find does not change the general conclusions of this study. 
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Modularity. — The scale-invariant properties naturally lead to the appearance of a 
hierarchy of self-similar nested communities or modules. In this novel point of view, boxes 
represent nested modules of different length scales. The importance of modular structures is 



stressed in biologica 
put to the test [K 



networks, where questions of function and evolutionary importance are 
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. The relevant question is whether the self-similar hierarchy 
of boxes encodes the information about the functional modules in biological networks. To 
answer this question we ana, yZ e the fraetal m e t abolie network of E. coU Q which has 
been previously studied using standard clustering algorithms jll|. Here we show that by 
repeatedly applying the renormalization we produce a tree with branches that are cl osely 
related to the biochemical annotation, such as carbohydrates, lipids, amino acid, etc [11.]. 
We renormalize the network at a given box size and cluster the substrates which belong to 
the same box and repeat the procedure to generate the hierarchical tree shown in Fig. H^,. 
In Fig. (the right-bottom scheme), we see a subnet of the original metabolic network 
with 14 nodes. They correspond to the bottommost layer of the hierarchical tree in the left. 
The box covering with £g = 3 indicates that this subnet contains four modules. The coarse- 
grained network is shown in the right-middle with 4 nodes: A, B, C and D. The next stage 
of renormalization combines these four nodes to one single node or class. Following this 
algorithm, we coarse-grain the network and classify the nodes at different levels. In Fig. 5a, 
we show this classification for the entire metabolic network. The different colors correspond 
to distinct functional modules, as we annotate in the bottom of the tree (carbohydrates, 
lipids, etc.). The clear division of biological functions in the hierarchical tree suggests that 
the metabolic network is organized in a self-similar way. 

The main known biochemical classes of the substrates emerge naturally from the renor- 
malization tree, indicating that the boxes capture the modular structure of the metabolic 
network of E. coli. The same analysis reproduces the modular structure of the protein 
interaction network of the yeast further suggesting the validity of our analysis [lsj ]. 

Robustness. — Finally our results suggest the importance of self-similarity in the evo- 
lution of the topology of networks. Understanding the growth mechanism is of fundamental 
importance as it raises the question of its motivation in Nature. For instance, given that 
systems in biology are fractal, there could be an evolutionary drive for the creation of such 
networks. A parameter relevant to evolution is the robustness of the network, which can be 
compared between fractal and non-fractal networks. 
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Non- fractal scale-free networks, such as the Internet, are extremely vulnerable to tar- 
geted attacks on the hubs 15]. In such non-fractal topologies, the hubs are connected and 
form a central compact core (as seen in Fig. |2b), such that the removal of few largest hubs 
(those with the largest degree) has catastrophic consequences for the network [2^ . Here 
we show that the fractal property of networks significantly increases the robustness against 
targeted attacks since the hubs are more dispersed in the network (see Fig. |2k). Figure 
|3t shows a comparison of robustness between a fractal and non-fractal network. The com- 
parison is done between model networks of the same 7 = 2.8, the same number of nodes 
(74,000), the same number of links, the same amount of loops and the same clustering co- 
efficient (see Supplementary Materials Section IVI) . Thus the difference in the robustness 
seen in this figure is attributed solely to the different degree of anticorrelation. We plot 
the relative size of the largest cluster, S, and the average size of the remaining isolated 
clusters, (s), after removing a fraction / of the largest hubs for both networks [15]. While 
both networks collapse at a finite fraction f c , evidenced by the decrease of S toward zero 
and the peak in (s), the fractal network has a significantly larger threshold (f c ~ 0.09) com- 
pared to the non-fractal threshold (f c ~ 0.02) suggesting a significantly higher robustness 
of the fractal modular networks to failure of the highly connected nodes. This could explain 
why evolutionary constraints on biological networks have led to fractal architectures. It is 
important to note that the comparison in Fig. is between two networks which preserve 
the modularity. Our results should be understood as follows: given that a network has a 
modular structure, then the most robust network is the one with fractal topology. There 
are other ways to increase robustness by, for instance, fully connecting the hubs in a central 
core 2^1, but this arrangement does not preserve the modularity. 

Summary. — We find that the statistical properties of many real networks are well 
consistent with the predictions of the proposed multiplicative model. Networks that can be 
captured by our theoretical framework include fractal networks (WWW, protein interactions, 
metabolic networks and some collaboration networks) and the non-fractal networks such 
as the Internet. The validity of the proposed framework is supported by the predicted 
scaling exponents (cIb, dk, d e ) in many real networks as well as the general properties of 
the connectivity distribution captured by P(ki,k 2 ) and the scaling relationships predicted 
by the multiplicative growth process of our model. Our results demonstrate that nodes 
are organized around dispersed hubs in self-similar nested modules characterized by 
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different functionalities. These then compartmentalize the hubs jl^ , and protect them from 
a failure at the system level 2f|. Hence, these modules function relatively autonomously 
so that a failure in one module cannot propagate easily to the next. This may provide a 
significantly higher protection against intentional attacks reducing the high vulnerability — 
the Achilles' heel — of non-fractal scale-free networks. 
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METHODS 

Details of the Mode I and Mode II of growth in the minimal model. — Mode 
I: To each node with degree k(t — 1) at time t — 1, mk(t — 1) offspring nodes are attached 
at the next time step (m = 2 in the example of Fig. QJd. As a result we obtain a scale-free 
non-fractal network: N B {l B )/N ~exp(— ^£ B ) and k B {l B )/kh u b ~ exp(— ^f£ B ), implying 
that both exponents d B and dk are infinite (since a — > 1 then d B = Inn/ In a — > oo and 
dk = Ins/ In a — > oo). This is a direct consequence of the linear growth of the diameter 
Lit). Moreover, the additive growth in the diameter with time implies that the network is 
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small- world. This mode is similar to a class of models called pseudo-fractals |27L 1281] . Mode 
II: It gives rise to a fractal topology but with a breakdown of the small-world property. 
The diameter increases multiplicatively leading to an exponential growth with time, and 
consequently to a fractal topology with finite d B and du- 
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FIG.[0 Self-similar dynamical evolution of networks, (a) The dynamical growth process 
can be seen as the inverse renormalization procedure with all the properties of the network 
being invariant under time evolution. In this example N(t) = 16 nodes are renormalized 
with Nb(£b) — 4 boxes of size 1% = 3. (b) Analysis of Mode I, only: the boxes are connected 
directly leading to strong hub-hub attraction or assortativity. This mode produces a scale- 
free, small-world network but without the fractal topology, (c) Mode II alone produces a 
scale- free with a fractal topology but not the small- world effect. Here the boxes are connected 
via non-hubs leading to hub-hub repulsion or disassortativity. 

FIG. El Empirical results on real complex networks, (a) Schematics showing that fractal 
networks are characterized by a power law dependence between Nb and Ib while (b) non- 
fractal networks are characterized by an exponential dependence, (c) Plot of the correlation 
profile of the fractal metabolic network of E. coli, -Re. co k(^i, k^) / Rwww(ki, fa), and (d) the 
non-fractal Internet Ri n t(ki, fa)/-Rwww(fa> fa)> compared with the profile of the WWW in 
search of a signature of fractality. (e) Scaling of £(£b) as defined in Eq. (JH) for the fractal 
topology of the WWW with d e = 1.5, and the non-fractal topology of the Internet showing 
that fractal topologies are strongly anticorrelated at all length scales. In order to calculate 
£ (and in all the calculations in this study) we tile the network by first identifying the nodes 
which are the center of the boxes with the largest mass and sequentially centering the boxes 
around these nodes. 

FIG. El Predictions of the renormalization growth mechanism of complex networks, 
(a) Resulting topology predicted by the minimal model for e = 0.8, n = 5, a = 1.4, 
s = 3 and m — 2. The colors of the nodes show the modular structure with each color 
representing a different box. We also include loops in the structure as discussed in the 
Supplementary Materials, Section IVT1 (b) Ratio -R e =i(fa, fa)/-Re=o.s(fa, fa) to compare the 
hub-hub correlation emerging from the model networks generated with e = 1 and e = 0.8, 
respectively, (c) Plot of Nb versus is showing that Mode I is non-fractal (exponential 
decay) and e = 0.8 is fractal (power-law decay) according to (b) and in agreement with the 
empirical results of Fig. El (d) Scaling of £(£b) reproducing the behavior of fractal networks 
for e = 0.8 and non-fractal networks Mode I, e = 1, as found empirically in Fig. Et- 

FIG. HI Practical implications of the renormalization growth approach and fractality. 
(a) Renormalization tree of the metabolic network of E. coli leading to the appearance of 
the functional modules. The colors of the nodes and branches in the tree denote the main 
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biochemical classes as: carbohydrates, lipids, proteins, peptides and aminoacids, nucleotides 
and nucleic acids, and coenzymes and prosthetic groups biosynthesis (grey), (b) Details of 
the construction of three levels of the renormalization tree for t B = 3 for 14 metabolites in 
the carbohydrate biosynthesis class as shown in the shaded area in (a), (c) Vulnerability 
under intentional attack of a non-fractal network generated by Mode I (e = 1) and a fractal 
network generated by Mode II (e = 0). The plot shows the relative size of the largest cluster, 
S, and the average size of the remaining isolated clusters, (s) as a function of the removal 
fraction / of the largest hubs for both networks. 
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SUPPLEMENTARY MATERIAL 



The Supplementary Material section is organized as follows: Section |1] shows that the 
available scale-free models are non-fractals. Section El describes more results on the correla- 
tion supplementing the various analysis and calculations in the main text . Section 11111 reveals 
that the statistical properties, in particular, the degree distribution keeps invariant under 
renormalization for real- world networks. Section HVl explains in detail the various theoretical 
results presented in the main text and elaborates on the extensions of the minimal model 
of fractal growth. Finally, Section IV1I gives more details of the study of the robustness of 
fractal and non-fractal networks under intentional attack. 



I. STUDY OF SCALE-FREE MODELS 



While the origin of the scale-free property can be reduced to two basic mechanisms: 
rowth and preferential attachment, as exemplified by the Barabasi- Albert model (BA model 
J]), the empirical result of fractality cannot be explained only in those terms. Notice that 
the term "scale- free" coined by Barabasi-Albert [l| refers to the absence of a typical number 
of links, as exemplified by a power-law distribution of degree connectivity, but it does not 
refer to the length scale invariance found in jj]. 

We find that all models of scale-free networks such as the BA model of preferential 
attachment tat], the hierarchical model IllL and the so-called pseudo fractal models and 

n n 

trees |27|, |28( are non-fractals. In Fig. El we plot the number of boxes Nb versus £b for the 
models showing that in all the cases the decay of Nb(£b) is exponential or faster, indicating 
either an infinite ds or not a well-defined fractal dimension. 

In the present study we find the relation 7 = 1 + In n/ In s, by using ds = In n/ In a, and 
dk = Ins/ In a, as explained in the text. However, non-fractal networks satisfy this relation 
as well despite the infinite fractal dimension ds — > 00. Thus in general we say that when 
7 = 1 + lnn/ In s is satisfied, then the degree distribution is invariant under renormalization. 
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FIG. 5: Test of fractality in scale-free models. Number of boxes versus size of the boxes of different 
models showing that they do not follow a power-law, but and exponential (or faster) function as 
shown in the insets. Thus, they are non-fractal, (a) The BA model of preferential attachment 



(b) the hierarchical model ll| and (c) the model of Jung, Kim, and Kahng ( JKK model) |28| which 
is an example of pseudo fractal models as discussed by Dorogovtsev and Mendes [2^ . 



II. THE RANDOM UNCORRELATED SCALE-FREE MODEL AND THE COR- 
RELATION PROFILES 



It is instructive to analyze the degree of correlation in networks considering the deviations 
of the joint probability distribution P(k\, k 2 ) from the random uncorrelated scale-free case 
P r (ki,k2). This latter model is obtained by, for instance, random swapping of the links 
in a given network 119], so that the degree distribution is preserved, but the correlation is 
completely lost. 

The study of the ratio R(ki, k 2 ) = P{k\, k 2 )/P r (ki, k 2 ) reveals that most of the networks 
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such as metabolic and protein interaction networks, the Internet and WWW are anticorre- 
lated in comparison with the uncorrelated random case. This is because, even though this 
model is uncorrelated, there is still an effective attraction between the hubs since there is 
a large probability to randomly connect two nodes with large degrees. Thus, a plot of the 
ratio R = P(k\, k 2 )/P r (ki, k 2 ) reveals that most of the real networks are anticorrelated in 
comparison with the uncorrelated model. Therefore this ratio does not allow to distinguish 
between fractal and non-fractal networks. 

In search of uncovering the extent of anticorrelation that are needed to obtain fractals 
we study the ratios for different networks by using the WWW as a reference (the use of any 
other network as a reference would lead to the same conclusions). This is done in the main 
text in Figs. |2fc and|21i and for the model in Fig. Eb- These plots should be interpreted as 
follows: For instance, in Fig. Efc, let us take a large degree k\ = 100 as an example. Then we 
see that the ratio RE.coii(ki, k 2 ) / Pwww(ki, k 2 ) has a maximum for k 2 ~ 5 (red-yellow scale) 
for small k 2 (k 2 < 10), and a minimum (blue scale) for large k 2 > 10. This means that the 
metabolic network has less probability to have hub-hub connection (two nodes with large 
degree connected) than a hub-non hub connection, when compared to the WWW. Therefore 
the metabolic network of E.coli is more anticorrelated than the WWW. In the same way, 
Fig. Eli shows that the hubs in the Internet have more probability to connect with other 
hubs than in the WWW, and therefore the Internet is less anticorrelated than the WWW. 
Therefore these patterns reveal that the fractal cellular networks are strongly anticorrelated 
(dissortative). 

The same analysis is performed for the model in Fig. Eb in the main text. For instance, 
in this figure, given a large degree ki = 300, the ratio R e =i(ki, k 2 ) / R e=0 . 8 (ki, k 2 ) is small 
(blue/green region) for small k 2 (k 2 < 10) but large (red/yellow region) for large k 2 (k 2 > 10). 
This means that the network with e = 1 is more likely to have hub-hub connections than 
the e = 0.8 case. Thus, the profile shows how e = 0.8 is more anticorrelated than Mode I. 

In summary, using a short notation, the strength of hubs repulsion satisfies: Uncorrelated 
scale-free model < Internet < WWW < protein interaction < metabolic networks. For the 
model we have: Mode I < Model with intermediate e < Mode II. 

It is important to note that we can investigate the ratio between different networks such 
as RE.coii{ki, k 2 )/ Rwww(ki, k 2 ) because P(ki,k 2 ) shows a power law behavior. Thus, even 
though the WWW and the metabolic network have different ranges of the values of k, power 
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law scaling of P(ki, h-z) implies that the ratio is independent on the region of k± and k% used 
to plot this quantity. 



III. INVARIANCE OF DEGREE DISTRIBUTION 

The renormalization procedure gives rise to a series of coarse-grained networks based on 
the box length £b- The statistical properties of these networks, in particular, the degree 
distribution keeps invariant, as we showed in the previous work 4|. In this section, we verify 
this property again for a wide range of real-world networks including both fractal networks 
(WWW, protein interaction network of yeast and metabolic network of E. coli) and non- 
fractal networks (Internet), as we show in Fig. El It is important to note that even though 
the Internet is not fractal, the degree distribution is still invariant under renormalization. 



IV. THEORY 

Here we elaborate on several theoretical expressions presented in the main text. We fully 
develop the theoretical framework of renormalization and its analogy with the time evolution 
of networks. 

The multiplicative growth law is expressed as: 

L{t + 1) + L = a(L(t) + L ), 

N(t + 1) = nNit), 
1 , (5) 

k(t + l) = sk(t), 

h h (t + l) = ek(t), 

where all the quantities have been previously defined in the main text. In Fig. ^ of the 
main text we provide an example of these quantities in a hypothetical growth process. In 
this example N(t) = 16 nodes are renormalized with Nb{£b) — 4 boxes of size £b = 3 so 
that N(t — 1) = 4 nodes existed in the previous time step. The box size is defined as the 
maximum chemical distance in the box plus one. The chemical distance is the number of 
links of the minimum path between two nodes. The central box has a hub with khub — 8 
links, then k(t) = 8. After renormalization, ksi^B — 3) = 3 for this central box, so that 
kit — 1) = 3. Out these three links, two are via a hub-hub connection (Mode I), thus 
"-h(^s) = 2 and £(£b) = 2/3, for this case. 
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FIG. 6: Test of the invariance of degree distribution under renormalization. We plot the degree 
distribution of (a) WWW j^]]], (b) Protein interaction network of yeast, the inward degree (c) and 
the outward degree (d) of metabolic network of E. coli and (e) Internet [2^, according to 
different box size is- 
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TABLE I: Relation between time evolution and renormalization. 



Quantity Time evolution Renormalization 



diameter 


L{h)+L 


L/(£ B + L ) 




T.(f n ) -L T, n 


L 


number of 


N(h) 


N b (£b) 


nor! ps 

AJ.VJ LLV_>0 


N(to) 


N 


degree 


Hh) 


k(£ B ) 




Hh) 


khub 


hub- hub 


Hh) 


k(£ B ) 


links 


nh(h\h) 


nh{&B) 


We obtain 


the relation between the quantities at two times t 2 > t\ as 




L(t 2 ) + L = a* 2 "' 


'(L(h) + L ), 




N(h) = 
Hh) 


= n* 2_tl iV(ti), 

(6) 

= ^-' 1 A;(t 1 ), 




n h {t 2 \tx) 


= e t2 - h k{h)- 


Notice that the quantity rih(t 2 \ti) represents 


a special case. This quantity indicates the 



number of links at time t 2l which are connected to hubs generated before time h. To avoid 
the confusion with the other quantities, we introduce a new notation nh{h\ti) instead of 
hh(t 2 ) as used for the other quantities in Eq. (JHJ). We also notice that the notation hh{t) 
in the main text Eq. (jSJ) is then interpreted as nh(t\t — 1) for short. We then obtain: 
nh{h\h) = e n,h(t 2 — l\tx) = . . . = e* 2- * 1 nh{h\ti) = e' 2- ' 1 k(ti), where we have used that 
n h {h\h) = k(h). 

The relationship between the quantities describing the time evolution and the renormal- 
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ization is shown in Table |U They are formalized as follows: 



i B + L = {L{t 2 ) + L )/(Z(ti) + Lo) = a t2 - h 

M{£ B ) = N B (£ B )/N = N(ti)/N(t 2 ) = n^, 

t , (7) 
S(l B ) = k B (£ B )/k hub = k B {h)/k B {t 2 ) = s h ~ t2 , 

£{£ B ) = n h (£ B )/k B (£ B ) = Mhlt^/Hh) = e' 2 "\ 

Here we define the additional ratios, M and S. Replacing the time interval t 2 — t\ by 
ln(£ B + L )/ In a, as obtained from the first equation in (J7|), we obtain: 

M(t B ) = (£ B + L y^ n l^\ 

S(£ B ) = (£ B + L y 1 ^ 1 ™, (8) 
£(£ B ) = (£ B + L )- 1 ^ 1 ™, 

or 

N{£ B ) = (£ B + L )~ dB ,d B = \nn/\na, 
S(£ B ) = (£ B + L o y d *,d k = Ins/ In a, (9) 
S(£b) = (£ B + L )- d %d e = \n(l/e)/\na, 

which correspond to the equations described in the main text. Notice that we have considered 
L = in Eqs. (0) for simplicity. Equations (jHJ) are more general and accommodate the 
case of non-fractal networks which are characterized by exponential functions: 

Af(£ B ) ~exp(-£ B /£ ), 

(10) 

S(£ B )~exp(-£ B /£' ). 

These expressions arise from Eqs. © by taking the limit of d B — > oo, c4 — > oo, and 
Lq — > oo while Lo/d B — *> £q and Lo/dk — > ^q, where £o and are characteristic constants of 
the network. 

A. The minimal model 

In the framework of the minimal model, we start with a star structure at t = as 
seen in Fig [T^i. At each time step mk{t) new nodes are generated for each node with 
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(b) Mode I (c) Mode II 




(d) 

FIG. 7: Different modes of growth with m = 2. Starting with (a) five nodes at t = 0, the different 
connectivity modes lead to different topological structures, which are (b) Mode I, (c) Mode II and 
(d) combination of Mode I and II with probability e = 0.5. 

degree k(t), where m is an input parameter (m = 2 in Fig. |7j). Accordingly, we have 
N(t + 1) = N(t) + 2mK(t), where K(t) is the total number of links at time t. Since we 
do not consider the loop structure at the moment, we have K(t) = N(t). Then we obtain 
N(t + 1) = (2m + l)N(t), or n = 2m + 1. We find that the results of the model are 
independent on the initial configuration. 

Then, two different connectivity modes are chosen as follows: Mode I, we keep all the old 
connections generated multiplicatively at time t (the red links in Fig. UJp)- Mode II, all the 
old connections generated in the previous time time step are replaced by links between new 
generated nodes (see the green links in Fig. Ujp). 



27 




k 



FIG. 8: Predictions of the model for e = 0.5 for the degree distribution showing the power law 
behavior with 7 = 3 and its invariance under time evolution. 

Mode I implies s = m + 1, since k(t + 1) = mk(t) + k(t), where the term mk comes 
from newly generated nodes, and the term k comes from the links at previous time steps. 
The diameter of the network grows additively as: L(t + 1) = L(t) + 2 (a = 1, Lq — *■ 00 
and (a — 1)L — > 2 in Eqs. (J5J)) because at each step we generate one extra node at 
both sides of the network and therefore the size of the network is increase by 2, as seen 
in Fig. Eb- This implies L(t) ~ 2t. For this mode we obtain a non-fractal topology: 
Nb{£b)/N ~ exp(— ^£b) and k{lB)jkhub ~ exp(— a direct consequence of the 
linear growth of the diameter Lit) which implies that the network is small- world. Moreover, 
a = 1 leads to ds = Inn/ In a — > 00 in this case (non- fractal). 

Mode II alone gives rise to a fractal topology but with a breakdown of the small-world 
property. The diameter increases multiplicatively L(t + 1) = 3L(t) (a = 3 and Lq = in 
Eqs. (jnj)), because we replace all the links at previous time step by the paths with chemical 
distance 3. The degrees grow as k(t + 1) = mk(t) according to our generation protocol, 
which leads to s = m. The multiplicative nature of L(t) leads to an exponential growth in 
the diameter with time, L(t) ~ e* ln3 , and consequently to a fractal topology with finite ds 
and dk according to Eqs. This is seen because for this mode we have a = 3. We 

obtain Nb{£b) ~ £~b b with finite ds = ln(2m + l)/ln3 and also k(lB)/kh u b ~ ^ h with 
dk = In m/ In 3. However, the multiplicative growth of L leads to the disappearance of the 
small-world effect, which is replaced by a power-law dependence. 
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The general growth process is a stochastic combination of Mode I (with probability e) 
and Mode II (with probability 1 -e, see Fig. EH). We obtain L(t + 1) = (3-2e)L(t) + 2e, and 
a = 3 — 2e and L = e/(l — e). Then, when e — > 1 (Mode I) a = 1 and <ie - ► oo, -^o ~~ * 00 
and we obtain a non-fractal topology. On the other hand, Mode II has e = 0, then L Q = 
and a > 1 and is finite, following the fractal scaling. For an intermediate < e < 1 this 
model predicts finite fractal exponents ds and df. and also predicts the small- world effect 
due to the presence of Mode I, as shown in the main text. This is seen in the exponential 
behavior of (M c ) versus £b- 

In Fig. IHlwe show further evidence that this model reproduces the self-similar properties 
found in fractal networks by plotting P(k). We find that the model is modular since P(k) is 
invariant under renormalization with 7 = 1 + Inn/ In s = 3, which is in agreement with the 
empirical findings of Fig. El Consistent with our predictions we find that dk = In s/ In a = 
3.3. 

Finally we summarize the predictions of the model. The fractal dimension is ds = 
Inn/ In a, and in the framework of the minimal model with probability e, we find a = 3 — 2e 
and Lq = e/(l — e). Then, when e — * 1 (Mode I) a = 1 and ds — > 00, L$ — > 00 and 
£0 = L /ds = 2/ Inn giving a non-fractal topology as in Eqs. (jlOj) . On the other hand, 
Mode II has e = 0, then Lq = and a > 1 and ds is finite, following the fractal scaling of 
Eqs. @, as long as the growth of the number of nodes is multiplicative with a well-defined 
value of n. The fact that a = 1 implies a linear growth of the diameter L(t) ~ 2t, which 
produces the small-world property. An intermediate model with, for instance e = 0.8 gives 
rise to a fractal network with the small world effect, as shown by the scaling of Nb(£b) in 
Fig. Efc and £{£b) in Fig. Eli, in the main text. 



B. Additional supporting evidence for the fractal network model 

Evidence is given in Fig. for the minimal model with parameter e = 0.8. We cal- 
culate (i) the mean number of nodes (mass) of the boxes tiling the network, (Mb{£b)) 
(= N/Nb(£b)), by using the box covering methods, and (it) the local mass (M c (£ c )) by 
averaging over boxes of size £ c around a randomly chosen node (the cluster growing method, 
see |2j,|4(). The results show how the minimal model reproduces one of the main properties 
of fractal networks 4-]: the power-law relation for the global average mass (Mb(£b)) ~ £b b 
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with ds = In 5/ In 1.4 = 4.8 cLS cL SI gnature of fractality consistent with Eq. and the 
exponential dependence of the local mass (M c (£ c )} ~ e e ^ e ° as a signature of the small- world 
effect: £ c ~ ln(M c ). Note that the cluster growing method is actually a way to measure the 
distance, while the box covering method measures the fractality |4J. The model leads also 
to a smooth monotonic scaling in the size distribution of modules as observed in Q. The 
global small- world properties are treated next. 



V. GLOBAL SMALL WORLD: SHORT CUTS IN THE NETWORK 

An important factor in the dynamics of real- world networks is the existence of randomness 
or noise in the growth process. The simplest type of noise is the appearance of random con- 
nections between nodes as exemplified in the Watts-Strogatz model of small world networks 
jjj. To investigate how noise affects the fractality of networks, we modify the dynamical law 
of the model as follows: at each time step, pK(t) number of links are added in at random, 
here K (t) is the total number of links at time t, and p is a constant that controls the fraction 
of noise. We build a fractal, small-world and scale-free topology with parameters m = 1.5, 
e = 0.5, and add p = 1% random connections at each time step. Our analytical considera- 
tions predict a box dimension d% = 2 in the absence of noise. The numerical simulation (see 
Fig. IHb) shows that this prediction of ds still fits well to the simulated data, except for a 
small deviation at large box sizes, i.e. the added noise appears as an approximate exponen- 
tial tail at large distances. Interestingly, this method could be used to test the appearance 
of noise in real complex networks, or to asses the quality of the data in, for instance, protein 
interaction networks obtained by yeast two-hybrid methods which are known to suffer from 
many false positives. 

Most interestingly, the addition of noise leads to the small-world effect at the global 
level. In principle the existence of fractality seems to be at odds with the small-world 
effect. Fractality implies a power-law dependence on the distance, while the small-world 
effect implies an exponential dependence Q. In Fig. we show how the combination of 
Mode I and Mode II of growth leads to the global fractal property and the local small-world 
effect. In Fig. we show that by adding a small fraction of short-cuts in a fractal complex 
network, we reproduce also the small-world effect at the global level. Using the algorithm 
explained above we add noise to the system and we find that the average distance < d > 
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(a) 





(b) 



(c) 



FIG. 9: Additional predictions of the renormalization growth mechanism of complex networks, 
(a) The hallmark of fractality ,4j is predicted by the minimal mechanism: the global tiling of the 
network evidences the fractality in the power-law dependence of the mass of the boxes, (Mb), 
while the local average of the cluster growing method (M c ), evidences the small- world effect. We 
show simulations for e = 0.8 and different network size. The scaling of (Mb) does not show finite 
size effects. The initial exponential dependence range of (M c ) increases as the size of the network 
increases, (b) Nb vs £b for the model with e = 0.5 shows that the fractality still holds in the 
presence of random noise. The straight line gives the theoretical prediction of the model ds = 2. 
(c) Average of the shortest path between two nodes as a function of the system size showing the 
global small world for the model (e = 0.5). 
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over all pairs of vertices is 

< d >~ 2.61 In A^, (11) 

(Fig. Efc), indicating that the fractal model also predicts the global small- world. We notice 
that the fraction of short cuts needed to obtain the global small-world is very small, around 
1%. 

VI. RESILIENCE OF FRACTAL NETWORKS UNDER INTENTIONAL AT- 
TACK 

To compare the stability of fractal and non-fractal networks under intentional attack (by 
removing hubs one by one from the largest to the smallest one), we generate two networks 
with e = and e = 1. In general the threshold of collapse under attack depends on several 
parameters and not only on the correlated properties of the network. Since we wish to asses 
only the effects of anticorrelation for the vulnerability of the network, we set all the other 
parameters to be equal. Thus, we use the same N, < k >, 7, and also the same number 
of loops in the structure. For this purpose we consider the number of intraloops inside the 
boxes and the number of interloops between boxes. 

In practice, inside the box there are mk(t) newly generated nodes. We add ymk extra 
links between them to generate triangles (y is a given constant) to obtain loops inside the 
boxes. For this case, we can rewrite n = 2(1 + y)m + 1, and the clustering coefficient 
C(k) = (2ym/ s)k~ l . Thus, this kind of loops give rise to the known scaling of the clustering 
coefficient with k 

Another type of loops appears when more than one link connects two boxes (interloops). 
We find empirically that these kind of loops are also arranged in a self-similar way and are 
characterized by a new scaling exponent. In the framework of the minimal model, this type 
of loops can be introduced by adding x number of links between boxes at each time step, 
instead of keeping one link between boxes. These links could be of type Mode II (i.e., links 
between non-hubs), or otherwise could be between a hub from one box to a non-hub in the 
other node. In fact, this last mode of growth is a third mode that can be considered in the 
minimal model. We have not included it so far for simplicity, since it does not give rise to 
any new result. In general this mode could be thought of as a modified Mode II, and does 
not change the general conclusions of this study. 
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Combining the loop structure inside the boxes (intraloops characterized by y) and between 
boxes (interloops characterized by x) we obtain a general formula for the average degree 
< k >= 2(1 + y) + (x — l)/m. In the case of the minimal tree structure discussed in the 
main text we have y = and x — 1, which leads to < k >= 2, consistent with our previous 
arguments. These networks are then used to generate the structures used in the calculation 
of the vulnerability of networks under intentional attack, shown in Fig. |3J 
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